Cleaning Data Sets with Diagnostic Errors in the High-Dimensional Feature Spaces
نویسندگان
چکیده
منابع مشابه
Feature Selection for Small Sample Sets with High Dimensional Data Using Heuristic Hybrid Approach
Feature selection can significantly be decisive when analyzing high dimensional data, especially with a small number of samples. Feature extraction methods do not have decent performance in these conditions. With small sample sets and high dimensional data, exploring a large search space and learning from insufficient samples becomes extremely hard. As a result, neural networks and clustering a...
متن کاملProjective ART for clustering data sets in high dimensional spaces
A new neural network architecture (PART) and the resulting algorithm are proposed to find projected clusters for data sets in high dimensional spaces. The architecture is based on the well known ART developed by Carpenter and Grossberg, and a major modification (selective output signaling) is provided in order to deal with the inherent sparsity in the full space of the data points from many dat...
متن کاملFeature Subset Selection using Rough Sets for High Dimensional Data
---------------------------------------------------------------------***--------------------------------------------------------------------Abstract Feature Selection (FS) is applied to reduce the number of features in many applications where data has multiple features. FS is an essential step in successful data mining applications, which can effectively reduce data dimensionality by removing t...
متن کاملThe support feature machine - an odyssey in high-dimensional spaces
vii Zusammenfassung ix Acknowledgements xi
متن کاملOn high dimensional data spaces
Data mining applications usually encounter high dimensional data spaces. Most of these dimensions contain ‘uninteresting’ data, which would not only be of little value in terms of discovery of any rules or patterns, but have been shown to mislead some classification algorithms. Since, the computational effort increases very significantly (usually exponentially) in the presence of a large number...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Mathematical Biology and Bioinformatics
سال: 2019
ISSN: 1994-6538
DOI: 10.17537/2019.14.464